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Abstract 

We survey recent results about asymptotic functions of groups, obtained by the au- 
thors in collaboration with J.-C.Birget, V. Guba and E. Rips. We also discuss methods 
used in the proofs of these results. 
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1 Results 
1.1 Definitions 

Recall that isoperimetric functions of a finitely presented group G = (X \ R) measure areas 
of van Kampen diagrams over the presentation of this group. Figure 1 shows what a van 
Kampen diagram may look like. 

*The research of the first author was supported in part by the Russian fund for fundamental research 
96-01-420. The research of the second author was supported in part by the NSF grant DMS 9623284 
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Fig. 1. 



It is a directed planar labeled graph where every edge is labeled by a generator from 
X, and the contour of every 2-cell (face) is labeled by a relator from R. By van Kampen 
lemma J23| j a word w € XUX^ 1 is equal to 1 in the group G if and only if there exists a van 
Kampen diagram A over the presentation of G with boundary label w. The number of cells 
in A is equal to the number of factors in a representation of uu as a product of conjugates 
of relators from R: 

w = Y[r^. (1) 

We are going to establish a more precise relation between equation ([!]) and the van Kampen 
diagram A later (see Lemma [l] below). 

If a van Kampen diagram has minimal number of cells, m, among all diagrams with 
the same boundary label w then we say that w has area m. A function f{m) is called an 
isoperimetric function of the presentation (X \ R) of the group G if every word of length 
at most m which is equal to 1 in the group has area at most f{m). On the set of functions 
N — > N, one can define a quasi-order saying that / -< g if 

f(m) < Cg(Cm) + Cm 

for all m and some constant C. Any minimal (with respect to -<) isoperimetric function is 
called the Dehn function of the presentation (X \ R). The article "the" is appropriate here 
because it is well known Q , [^] , that Dehn functions of different presentations of the 
same group are equivalent that is they satisfy inequalities 

fx < f-2, h < fi 



We can also define isodiametric functions introduced by Gersten []15[. These functions 
measure the diameter of a van Kampen diagram with given perimeter^. More precisely, 

1 Recall that the diameter of a graph is the maximal distance between two vertices of the graph. 
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with every word w which is equal to 1 in G, we associate its diameter, that is the smallest 
diameter of a van Kampen diagram with boundary label w. Then if d{n) is an isodiametric 
function of G = (X \ R), d(n) must exceed the diameter every word w = 1 (mod G) of 
length < n. The equivalence of isodiametric functions is defined as before. Isodiametric 
functions of different presentations of the same group are always equivalent [18]. 

Both isoperimetric and isodiametric functions reflect the decidability of the word prob- 
lem in the group. In particular, |15|| , the word problem is decidable if and only if the Dehn 
function (the smallest isodiametric function) is recursive. Nevertheless, the word problem 
in a group with huge Dehn function may be easy. For example the word problem in the 
Baumslag-Solitar group (a, b \ a b = a 2 ) can be solved in quadratic time (since this group is 
representable by 2 x 2 integer matrices) while the Dehn function is exponential [10|. One of 
our main goals is to show that still there exists a very close connection between the Dehn 
functions and the computational complexity of the word problem. 

If G = (X) and H = {XUY) are finitely presented, one can also define the area function 
of G. This function is defined on the set W of all words in the alphabet X which are equal 
to 1 in G. It takes every word from W to the area of this word in H. 

Other important concepts are the one of a distortion function and the one of a length 
function. Let G = (X) be a finitely generated subgroup of a finitely generated group 
H = (Y). Then the distortion function dc,H{ n ) ■ N — > N takes every natural number n to 
max{|w|x | u 6 G, \u\y < n}. In other words, in order to compute dQ,H{n) we consider all 
(finitely many) elements of G whose lengths in H are at most n, for each of these elements 
we compute its length in the alphabet X, and then take the maximum of these lengths. 

The corresponding length function of G inside H is the function t : G — > N which takes 
every element g of G to |<?|y, the length of g in H. 

Two functions /i,/2 : G — > N are called O-equivalent if fi(g) < 0/2(5), /2G?) < cfi(g) 
for some constant c and every g £ G. Different choices of generating sets in G and H lead 
to O-equivalent length functions £h ■ G — > N and equivalent distortion functions associated 
with this embedding. 

If \g\x = 0(|<7|y) for every g € G, or, equivalently, if the distortion function is at most 
linear we say that G is quasi-isometrically embedded into H or that G has bounded distortion 
in H. Otherwise we say that G is has unbounded distortion. 

For example, every subgroup of the free group has (obviously) bounded distortion, but 
the (cyclic) center C = (c) of the 3-dimensional Heisenberg group = (a, b, c|| [a, b] = 
c, ca = ac, cb = be) has quadratic distortion. Indeed c n = [a n , b n ] for every n, the length of 
c n in C is n 2 and the length of this element in is < 4n. 

Just as Dehn functions and isodiametric functions reflect the decidability of the word 
problem, the distortion function reflects the decidability of the membership problem for 
subgroups: if G is a finitely generated subgroup of a finitely generated group H which has 
solvable word problem, then the membership in G for elements of H is decidable if and only 
if the distortion function of G in H is recursive [|ll]]. 

In this paper, we survey recent results about isoperimetric, isodiametric, length and area 
functions of groups obtained by the authors in collaboration with J.-C. Birget, V. Guba 
and E. Rips. 
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There are several important connections between Dehn functions and length functions. 
We present two easy statements without proofs here. 

Theorem 1 (Bridson, ffl) Let G be a finitely presented group and H < G be a finitely 
generated subgroup of G with distortion function d(n). Then the Dehn function of the HNN 
extension H = {G, t \h = h, h € H) is at least d(n). 



Theorem 2 (Olshanskii, Sapir 1998) The set of distortion functions of finitely generated 
subgroups of the direct product of two free groups F2 x F2 coincides ( up to equivalence ) with 
the set of all Dehn function of finitely presented groups. 

Theorem |2| is new although a remark in [18| hints to a possibility of some connection 



between distortion of subgroups in F2 x F2 and Dehn functions. Here is a proof of this 



theorem. It uses the well known Mikhailova's trick (see [23|) and a result from Baumslag 
and Roseblade ||. 

It is proved in || that every finitely generated subgroup E of F2 x F2 is the equalizer (in 
H it is called the free corner pullback) of two homomorphisms <fi : E' — > G and ip : E" — > G 
of two finitely generated subgroups E' , E" of F2 onto a finitely presented group G, that is 
E = {(u,v) £ E' x E" I (p(u) = ip(v)}. Since every finitely generated subgroup of F2 has 
bounded distortion, E' x E" is quasi-isometrically embedded into F2 x F2. So it is enough to 
show that the distortion function of the equalizer of two homomorphisms <j) : F m — > G and 
ip : F n — > G of two free groups onto a finitely presented group G in F m x F n is equivalent 
to the Dehn function of G. 

Let d(k) be the Dehn function of G. Let F m = {x±, ...,x m ), F n = (yi,.-,y n ) ( w e 
assume that these generating sets are closed under taking inversese). As a generating set 
for H = F m x F n we take the set of all pairs (xj, 1), (1, yj). Let E be the equalizer of <f> and 
ip in H. 

Let 7*1, r| be generators of the kernel of ?/> (as a normal subgroup of -F n ). Without loss 
of generality we assume that the set {7*1, --^rg} is closed under cyclic shifts and inverses. 

For every i = l,...,m pick one word ti £ F n such that (f)(xi) = ip(ti). For every 
j = 1, n pick one word Sj such that </>(sj) = ip(yj)- Then the equalizer E is generated by 
the pairs (xi,U), (sj,yj), (1, r&), 7 = 1, m, j = 1, n, fc = 1, Indeed, if (it, G E 1 , 
that is 4>(u) = ip{v) and u = Xi 1 Xi 2 ...Xi p then 

(u,u) = (xi 1 ,ti 1 )...(xi p ,t ip )(l,a) 

where ip(a) = 1. Since a belongs to the kernel of if), we have that a is a product of conjugates 
of r k : 

d 

a = IK 1 - 

1=1 

Therefore 

(u,v) = (x, 1 ,t, 1 )...(x v ,t ip )n( 1 '^) K{ ^^ ) - ( 2 ) 

i=l 
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Here w(s) denotes the word w where each yj is substituted by the corresponding Sj. 

We shall prove that the distortion function of E in H is equivalent to d(k). In order to 
do that we need the following general statement. 

Lemma 1 Let A be a van Kamyen diagram over a presentation (X \ R) where X = X^ 1 , 
R is closed under cyclic shifts and inverses. Let w be the boundary label of A. Then w is 
equal in the free group to a word of the form u\r\uiri---iid r 'd^d+i where: 

1. n € R; 

2. uiU2---Ud + i = 1 in the free group; 

3- J2i=i \ u i\ < 4e where e is the number of edges of A. 

Proof. If A has an internal edge (i.e. an edge which belongs to the contours of two 
cells) then it has an internal edge / one of whose vertices belongs to the boundary. Let 
us cut A along / leaving the second vertex of / untouched. We can repeat this operation 
until we get a diagram Ai which does not have internal edges. It is easy to see that the 
boundary label of Ai is equal to w in the free group. The number of edges of Ai which do 
not belong to contours of cells (let us call them edges of type 1 is the same as the number 
of such edges in A and the number of edges which belong to contours of cells in Ai {edges 
of type 2) is at most twice the number of such edges of A (we cut each edge from a contour 
of a cell at most once, after the cut we get two external edges instead of one internal edge). 

Suppose that a cell II in Ai has more than one edge which has a common vertex with 
II but does not belong to the contour of IT . Take any point O on 9(11). Let p be the 
boundary path of Ai starting at O and let q be the boundary path of II starting at O. 
Consider the path qq~ l p. The subpath q~ x p bounds a subdiagram of Ai containing all cells 
but II. Replace the path q in qq~ l p by a loop q' with the same label starting at O and lying 
inside the cell II. Let the region inside q' be a new cell II'. Then the path q'q~ l p bounds 
a diagram whose boundary label free is freely equal to w. Notice that II' has exactly one 
edge having a common vertex with II' and not belonging to the contour of IT. Thus this 
operation reduces the number of cells which have more than one edge which has a common 
vertex the cell but does not belong to the contour of it. 

After a number of such transsformations we shall have a diagram A2 which has the form 
of a tree T with cells hanging like leaves (each has exactly one common vertex with the 
tree) . 

The number of edges of type 1 in A2 cannot be bigger than the number of all edges in 
Ai, so it cannot be more than two times bigger than the total number of edges in A. 

The boundary label of A2 is freely equal to w, and it has the form u\ri x u<ir{^...u ( ]ri A u ( i+\ 
where d is the number of cells in A, u\U2---Ud+\ is the label of a tree, so u\U2---Ud+i = 1 in 
the free group. The sum of lengths of u; L is at most four times the number of edges in A 
because the word u\U2--.Ud+\ is written on the tree T, and when we travel along the tree, 
we pass through each edge twice. 

The lemma is proved. 
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Let us consider the distortion function of E in H. Without loss of generality we can 
assume that d(k) is the Dehn function of the presentation (yi, ...,y n \ r±, ...,77} (recall that 
Dehn functions of different finite presentations of the same group are equivalent). 

Let (u,v) be any element in E whose length in H, \u\ + \v\, is k > 1. Then as before 

(u,v) = (xi 1 ,t il )...(x ip ,t ip )(l,a) 

where u = x^.-.x^, ip(a) = 1. 

Notice that the length of the word a does not exceed 

\v\ + c\\u\ < c\(\u\ + \v\) = c\k 

where c\ is the maximal length of \ti\, i = 1, ...,m. 

Let A be the minimal area van Kampen diagram over the presentation 

(yi, -,Vn I n, -,re) 

of G with the boundary label a. Then the area of A does not exceed d{c\k). Since A is 
a planar graph, its number of edges e does not exceed a constant times the area plus the 
length of the boundary of A. 
By Lemma |], 

a = uir^uz—r^Uq+i 
where u\...u q+ i = 1 in the free group, and \ u i\ — 4e. Then 

(l,a) = (ui(s) ■ 1 • u 2 (s) • 1 • ... • 1 • u q+ i(s), uir il n 2 ...r i9 ii 9+ i) 
= u[ ■ (l,r h ) ■ u' 2 (l,r i2 ) ■ ... ■ {l,r iq ) ■ u' q+1 

where v! i denotes the word U{ with letters yj substituted by (sj,yj). Therefore the length 
in E of the element (l,a) does not exceed C2d{c\k) + c\k for some constant 02- Hence the 
length in E of the element (u,v) does not exceed k(l + c\) + C2d(c%k). 

This implies that the distortion function of E in H does not exceed a function equivalent 
to the Dehn function d. 

To prove that the Dehn function d does not exceed a function equivalent to the distortion 
function of E in H, it is enough, for every number p > 1, to take a word a € F n , from the 
kernel of ip, \a\ < p, of area d(p). Then it is easy to see that any representation of (1, a) as 
a product of generators of E must contain at least d(p) factors of the form (because 
it corresponds to a representation of a in the form uiri 1 v,2ri 2 ...ri d u ( i+i where ui...Ud+i = 1 
in the free group). □ 

1.2 Dehn Functions of Groups 

Our first goal is to give an almost complete description of Dehn functions of finitely presented 
groups in terms of time functions of Turing machines. First of all Birget and Sapir [|33| 
proved that every Dehn function is the time function of a nondeterministic Turing machine. 
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Theorem 3 (Birget, Sapir, jffi^j) Every Dehn function of a finitely presented group G is 
equivalent to the time function of some (not necessarily deterministic) Turing machine 
solving the word problem in G. 

This result restricts the class of functions which can be Dehn functions of groups. Indeed, 
time functions of non-deterministic machines are functions f(n) which can be computed 
deterministically in time at most 2^ n \ It is easy to construct a recursive number a > 2 
such that the function [n a ] is not computable even in double exponential time by a Turing 
machine, so n a is not equivalent to the Dehn function of any finitely presented group. 
This answers a question by Gersten (he asked if every increasing recursive function > n 2 is 
equivalent to the Dehn function of a finitely presented group) . 

The set of Dehn functions "must" satisfy a yet another restriction: every Dehn func- 
tion "must" be superadditive (more precisely, it "must" be equivalent to a superadditive 
function), that is f(m + n) > f(m) + f(n) for every m,n. We put the word "must" in 
quotation marks because the proof of this restriction is yet to exist. Here is a quasi-proof. 
Notice that it is enough to show that f(m + n + c) > f(m) + f{n) for some constant c and 
all m,n (since we identify equivalent functions). Now, if the word w of length < m has 
area f(m) and the word w' of length < n has area f(n) and there are no cancellations in 
the product w 9 w' where g is a word of small length (< c/2), then this product "cannot" 
have area smaller than f(m) + f{m). Since the length of w 9 w' is < m + n + c, we have 
f(m + n + c) > f(m) + f(n). Figure 2 shows a diagram with boundary label w 9 w' . 




Fig. 2. 

Of course the problem is that we can probably tessellate the disk with the boundary 
label w 9 w' in a different, more economical, way. Still there are so many ways to choose 
g, and to connect two van Kampen diagrams that it seems unlikely that we cannot find a 
product w 9 w' with area f(m) + f(n). 

Although, as we have said the proof of superadditivity property does not exist at that 
time, Guba and Sapir were able to prove the following partial result. 

Theorem 4 ( Guba, Sapir, fftfy) The Dehn function of every group which is a free product 
of two non-trivial groups, is superadditive. 
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The proof of this theorem basically shows that the idea presented above works in the 
case of free products. 

In view of Theorem ||, the superadditivity property is equivalent to the following prop- 
erty: 

The Dehn function of any finitely presented group G is equivalent to the 
Dehn function of the free product G * Z. 

The next theorem gives a description of Dehn functions. It shows that the class of 
functions f(n) > n 4 satisfying restrictions mentioned above virtually coincides with the 
class of Dehn functions > n 4 of finitely presented groups. 

Theorem 5 (Sapir, Birget, Rips, W^) Let M be a not necessarrily deterministic Turing 
machine with time function T(n) for which T(n) 4 is superadditive. Then there exists a 
finitely presented group G(M) = (A) with Dehn function equivalent to T(n) 4 , and the 
smallest isodiametric function equivalent to T(n) 3 . 

Moreover, G(M) simulates M, that is there exists an injective map K from the set of 
input words of M to (A U A^) + such that 

1. 1/C\u\ < | If (it) | < C\u\ for some constant C > 1 and for every input word u; 

2. An input word u is accepted by M if and only if K(u) = 1 in G; 

This theorem implies the following description of the "isoperimetric spectrum" in [4, oo), 
that is the numbers a > 4 such that n a is equivalent to a Dehn function of a finitely 
presented group. 

We say that a real number a is computable in time < T{m) for some function T(m) if 
there exists a deterministic Turing machine which for every number m written in binary 
computes the first m digits of a in time at most T{m). 

Theorem 6 (Sapir, MBJ) For every real number a > 4 computable in time ^ 2 2 " 1 the 
function n a is equivalent to the Dehn function of a finitely presented group and the smallest 
isodiametric function of this group is n 3 / 4a . On the other hand if n a is the Dehn function 
of a finitely presented group then a is computable in time ■< 2 

Of course all well known numbers > 4 (say, rational numbers, e + 2,e7r,21og 2 ^ for 
integers a, b, a > 46), are computable in polynomial time, so for these numbers a, n a is 
the Dehn function of a finitely presented group. For a < 4, Brady and Bridson proved 
that the spectrum contains all numbers of the form 2 log 2 tt where a > b are integers, so 
the spectrum is dense in the set of all real numbers, but a description similar to Theorem 
P is not known for numbers < 4. Even for non-integer rational numbers between 2 and 4 
we do not yet know if they belong to the isoperimetric spectrum. We expect the result for 
a G [2, 4) to be similar to Theorem || 

Of course Theorem || provides examples of Dehn functions which are much more compli- 
cated than n a . For example, functions like n 27r (log n) logn log log n are clearly equivalent to 
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fourth powers of time functions of Turing machines (hint: take the Turing machines which 
calculates the fourth root of such a function in the unary notation) , and by Theorem [5] they 
are equivalent to Dehn functions of finitely presented groups. 

Theorem |2| allows us to formulate the following corollary of Theorem || which gives 
examples of subgroups of the direct product of two free groups with "arbitrary weird" 
distortion. 

Theorem 7 (Sapir) For every time function T(n) of a non- deterministic Turing machine 
with super additive T(n) 4 there exists a subgroup of F2 x F2 with distortion function T(n) 4 . 
In particular for every real number a > 4 computable in time < 2 2 ™ there exists a subgroup 
of i*2 x F2 with distortion function equivalent to n a . 

Recall that F2 x F2 is automatic. Notice that every cyclic subgroup of it has (obviously) 
bounded distortion. 



1.3 Length Functions of a Finitely Generated Group 

Theorems § and gives information about the set of distortion functions of subgroups of 
one particular group, F2 x F%- In this section, we shall fix an arbitrary finitely generated 
group G and describe all possible length functions (and hence distortion functions) of G 
inside other groups. 

A complete description of all length functions of a finitely generated group is given by 
the following theorem. 



Theorem 8 ( Olshanskii, j^q/) Let £ : G — > N be a length function on a group G. Then the 
following conditions hold: 

(Dl) 1(g) = £(g _1 ) for every g G G; £(g) = if and only if g = 1. 
(D2) £(gh) < 1(g) + 1(h) for every g,heG. 

(D3) There exists a positive number c such that the cardinality of the set {g G G \ £(g) < r} 
does not exceed c r for every r G N. 

Conversely for every group G and every function £ : G — > N satisfying (Dl) - (D3), there 
exists an embedding of G into a 2-generated group H with generating set B = {61,62} such 
that the length function g — > \g\s is equivalent to f . 

In the particular case when G is a cyclic group, Theorem |8| implies that for any number 
a G (0, 1] , there exists a group H a > G and an element g G H a such that the length of g l 
in H a grows as i a . This gives an answer to Gromov's question [|18|. 



Another problem by Gromov [18] asked for a description of length functions of cyclic 
groups in finitely presented groups. It is clear that not every function satisfying (D1)-(D3) 
can be a length function of the cyclic group in a finitely presented group: the cardinality of 
the set of O-equivalence classes of functions satisfying (D1)-(D3) is continuum, and the set 
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of embeddings of the infinite cyclic group into finitely presented groups is countable. Nev- 
ertheless the following theorem shows that all "reasonable" functions are length functions 
of a given finitely generated group G in a finitely presented group. 

Let G be a group with a finite generating set A = {ai, . . . , a m }. Let F m be the free 
group generated by A U A^ 1 . Every function £ : G — > N can be naturally extended to a 
function £* : F m — > N. We say that £ is computable if ^* is computable in the natural sense. 

Theorem 9 (Olshanskii, Jjjlj/J Le£ £ be a computable function G — ► N satisfying (Dl)- 
(D3). Then G can be embedded into a finitely presented group H in such a way that the 
corresponding length function is equivalent to I. 

This theorem immediately follows from Theorem || and the following result. 

Theorem 10 (Olshanskii, j^j) Every finitely generated and recursively presented group G 
can be quasi-isometrically embedded into a finitely presented group. 

Although Theorem |9| shows that all "reasonable" functions are length functions of a 
given finitely generated recursively presented group inside finitely presented groups, it does 
not give a characterization of these functions. Such a characterization has been found 
recently by Olshanskii. This answers questions asked by P. Papasoglu and R. Gilman. It 
also gives a complete solution of Gromov's problem from fl8|| . It turned out that such a 
characterization can be easily deduced from and [p9j] . 

We say that a function I : G — > N satisfies condition (D4) if there exists a natural 
number n and a recursively enumerable set S C F m x F n such that 

(a) if (vi,u),(v2,u) £ S for some words v\,V2,u then v\ and V2 represent the same 
element in G; 

(b) £*(v) = min({|w| | (v,u) G S}) for every v G F m . 

Clearly it does not depend on the choice of generators of G whether £ satisfies condition 
(D4) or not because of the obvious rewriting. 

Notice that in (D4), we can always assume n = 2. Indeed, if condition (D4) holds for a 
function £ and some n, it also holds for £ and any natural number n' > 2 since there is an 
isomorphic embedding of F n into F n > . 

Theorem 11 ( Olshanskii, Let G be a finitely generated subgroup of a finitely presented 
group H. Then the corresponding length function on G satisfies conditions (Dl)-(D^.). 
Conversely, for every finitely generated group G and every function £ : G — > N satisfying 
conditions (D1)-(D4), there exists an embedding of G into a finitely presented group H such 
that the length function g i— > \g\jj is O -equivalent to £. 

Condition (D4) is relatively complicated. We do not know if it is possible to simplify 
it in general. But in the case when the group G has solvable word problem, including 
the important case when G is cyclic, condition (D4) can be replaced by a much simpler 
condition. 

As usual, the graph of a function £* : F m — > N is the set (w,£*(w)) C F m x N. A pair 
(w, k) is said to lie above the graph of £* if £*(w) < k. 
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Theorem 12 (Sapir, ply) Let G be a finitely generated group with decidable word problem. 
Then the function £ : g i— > \g\jj given by an embedding of G into a finitely presented group 
H satisfies condtions (D1)-(D3) and the following condition 

(D4') The set of pairs above the graph of I* is recursively enumerable. 

Conversely, for every function £ : G — > N satisfying conditions (Dl), (D2), (D3), 
and (D4'), there exists an embedding of G into a finitely presented group H such that the 
corresponding length function on G is O-equivalent to £. 

It is again clear that whether condition (D4') holds or not does not depend of the choice 
of generators of G. 

In the important particular case when G is the infinite cyclic group we have 

Corollary 1 (1 )Let g be an element of infinite order in a finitely presented group H with 
a generating set B = {b\, ...,&&}. Denote £{i) = \g l \i3 = |<7 l | for £ € Z. Then 

(CI) £(i) = £(—i) for i £ Z (I is symmetric), and £{i) = iff i = 0; 

(C2) £(i + j) < £{i) + £{j) for i, j E Z (I is subadditive); 

(C3) there is a positive number c such that card{i € Z|£(i) < r} < c r for any r € N. 

(C4) the set of integer pairs above the graph of £ is recursively enumerable. 

(2) Conversely, for any function £ : Z — > N, satisfying the conditions (C1)-(C4), there is 
a finitely presented group H and an element g 6 H such that \g l \ii is O-equivalent to £(i). 

It is easy to prove that (D4) implies (D4'). Indeed, suppose that (D4) holds. Consider 
a Turing machine M listing elements of the recursively enumerable set E. Let us change 
the machine M in such a way that (1) instead of pairs (w, u) from F m x F n it produces 
pairs (w, \u\) from F m x N and (2) after every, say, 10, steps of calculation, it goes through 
all pairs listed so far and for each of these pairs (wi, ki) adds a pair (wi, hi + 1) to the list, 
then it does the next 10 steps of calculations, etc. Clearly, this new machine will list all 
pairs which are above the graph of £* and only these pairs. Thus the set of pairs above the 
graph of £* is recursively enumerable and condition (D4') holds. 

By the proper choice of a universal group H it is not difficult to sharpen the formulation 
of Theorems |9| and 11. One can select the group H in these theorems (independently of G) 



as the receptacle of all possible "computable distortions" of finitely generated recursively 



presented groups. The next theorem follows from Theorem 4 from [29]. 



Theorem 13 (Olshanskii, 1998) There exists a finitely presented group H, having the fol- 
lowing property. For an arbitrary finitely generated recursively presented group G and an 
arbitrary function £ : R — > N satisfying conditions (D1)-(D4) there exists an embedding of 
G into H such that the length function of G corresponding to this embedding is O-equivalent 
to £. 
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1.4 Groups with Word Problem in NP 

The well known Higman theorem says that a group has a recursive presentation if and only 
if this group is embeddable into a finitely presented group. Theorem |l0| strengthens this 
result. The next Theorem strengthens it even further. 

Theorem 14 (Birget, Olshanskii, Rips, Sapir ffij) Let G be a finitely generated group with 
word problem solvable by a non-deterministic Turing machine with time function < T(n) 
such thatT(n)^ is superadditive. Then G can be quasi-isometrically embedded into a finitely 
presented group H with isoperimetric function equivalent to n 2 T(n 2 ) 4 . In particular, the 
word problem of a finitely generated group is in NP if and only if this group is a (quasi- 
isometric) subgroup of a finitely presented group with polynomial isoperimetric function. 



In particular, this theorem gives a Higman- like description of groups with word problem 
in NP. 

The class of finitely generated groups with word problem in NP is very large. It clearly 
includes all matrix groups over Q. It also includes 

• All finitely generated matrix groups over arbitrary fields: this follows from the fact 
that every finitely generated field is a finite extension of a purely transcendental 
extension of its simple subfield, and the fact that the word problem in the ring of 
polynomials over Q or Z/pZ is solvable in polynomial time, 

• Polycyclic and finitely generated metabelian groups because they are representable 
by matrices p3 , 



• Automatic groups (in particular, hyperbolic groups) JT0|, 

• Groups of piecewise linear transformations of a line with finitely many rational sin- 
gularities (including the R. Thompson group F) || , 

• Every finitely generated subgroup of a diagram group [19], 

• Every free Burnside group B(m,n) for sufficiently large odd exponent n (see, for 
example, Storozhev's argument in Section 28 of pq]). 

This class is closed under free and direct products. It is easy to see using Magnus' 
embedding that for every normal subgroup A of a free finitely generated group F if F/N 
has word problem in NP (resp. P) then F/N' has word problem in NP (resp. P). Therefore 
every free group in the variety of all solvable groups of a given class has word problem in P. 

It is an interesting question whether this class also contains all one-relator groups. There 
are of course finitely generated groups with word problem not in NP, for example groups 



with undecidable word problem. Moreover the construction from |33|] allows one to construct 



12 



groups with decidable but arbitrary hard word problem. But these groups are in some sense 
"artificial". So perhaps the class of groups with word problem in NP (which by Theorem 



14 is the class of all subgroups of finitely presented groups with polynomial Dehn functions) 



can be considered as the class of "tame" groups. 

An example of an embedding of one group into another where lengths are not distorted 
but areas are distorted can be found in Gersten |jT4f |. Some examples of groups with big Dehn 
functions embeddable into groups with small Dehn functions can be found in Madlener, Otto 
p4| and Baumslag, Bridson, Miller and Short 0]. Our results show that any recursively 
presented finitely generated group can be embedded into a finitely presented group with 
bounded length distortion but with close to maximal possible area distortion. Indeed, 
Theorem || shows that an isoperimetric function of a group H containing a given group G 
cannot be smaller than the time complexity T(n) of the word problem for G, and Theorem 



14 shows that G can be embedded into a finitely presented group with Dehn function at 
most n 2 T(n 2 ) 4 (which is polynomially equivalent to T(n) ). 

For matrix groups our theorem implies that every such group is embedded quasi- 
isometrically into a finitely presented group with Dehn function at most n 10+e for every 
e > 0. It is interesting to know the smallest Dehn function of a finitely generated group 
containing, for example, the Baumslag-Solitar group -BS^i- 



Notice that a semigroup analog of Theorem |14J was obtained in Birget |6| 



As it usually happens, solution of one problem leads to solutions of other problems. 
In 1976, D. Collins asked [21] if there exists a version of the Higman embedding theorem 



which preserves the degree of unsolvability of the conjugacy problem. The answer is "yes" 
as the following theorem shows. 

Theorem 15 (Olshanskii, Sapir, 1998) The embedding described in the proof of Theorem 



14 preserves the degree of unsolvability of the conjugacy problem. In particular, the conju- 



gacy problem is decidable in G if and only if it is decidable in H. 

Using the proof of Theorem [14|, in order to embed a finitely generated group G with word 
problem in NP into a finitely presented group with polynomial isoperimetric function, one 
needs first construct a Turing machine which solves the word problem in G, then convert 
it into a so called S'-machine (see below), then convert the S-machine into a group. As a 
result the group we construct will have a relatively complicated set of relations. In some 
important cases like the Baumslag-Solitar group G^i, the free Burnside groups B(m,n), 
where n is odd and >> 1, and others, we can modify our construction and get simple 
presentations of groups with polynomial isoperimetric functions where these groups embed. 

Consider, for example, the free Burnside group B{m,n) with m generators {01, ...,a m } 
and exponent n. This group is very complicated and in particular not finitely presented if 
m > 2 and n is odd and > 665 (Adian, Q). Now we are going to give a presentation of a 
finitely presented group H with a polynomial isoperimetric function, containing B(m,n) as 
a quasi- isometric subgroup. 
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The relations of B(m,n) have the form u n = 1 where u is an arbitrary word in the 
alphabet of generators. So our goal is to find a finite set of relations of a bigger group which 
will imply all the relations u n = 1 (and no extra relations between generators of B(m,n)). 

Instead of first writing relations of H, and then drawing van Kampen diagrams we shall 
first draw diagrams, and then write relations. 

For simplicity take n = 3. The construction really does not depend much on n, so we 
shall sometimes write n instead of 3. First of all, we shall find a finite set of relations which 
imply relations of the form 

K(uqiuq 2 uq 3 ) = k 1 (uq 1 uq2uq 3 )k 2 (uq 1 uq 2 uq 3 y k 3 ....k N (uq 1 uq 2 uq 3 ) ( - N * ) 

for every word u in the alphabet {ai, a m }. Here N is a sufficiently large number (28 
is enough), k±, k^, q±, q 2 , q 3 are new letters, and the words between consecutive k's are 
copies of uq\uq 2 uq 3 written in disjoint alphabets. The group given by these relations will 
be denoted by G m ^ n . Figure 3 shows the van Kampen diagram (below it will be called a 
disc) with boundary label K (uq\uq 2 uq 3 ) . 

uq 1 uq 2 uq 3 




Fig. 3. 

On the boundary of this diagram we have the word K (uq\uq 2 uq 3 ) . The words on each 
of the concentric circles is labeled by K {uiq\Uiq 2 Uiq 3 ) where Ui is a prefix of u of length 
i — 1. The word written on the innermost circle is K{q\q 2 q 3 ). This word will be called the 
hub. The edges connecting the circles are labeled by letters r±, ...,r m . The cells tessellating 
the space between the circles have labels 

• q? = ajqi, i = l,2, 3, j = 1, m. 

• ar = ra, a G {ai, ...,a m }, r G {n, ...,r m } 
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• kr = rk, k G {fci, k N }, r G {n, ...,r m }. 

plus iV copies of each of these relations written in iV disjoint alphabets. These relations 
plus the hub relation if (ft 9293) form the presentation of G m>n . 

Now we construct H = H m ,n- Take a copy of B(m,n) generated by {b±, ...,b m }. The 
group H will be an HNN-extension of the direct product B(m,n) x G m:Jl . Here is the van 
Kampen diagram: 



uqiuq 2 uq 3 




Fig. 4. 

This is an annular diagram. The hole of it has label v% (ub is the word u rewritten in 
the alphabet {61, b m }). The boundary label of the disc is K \uq1uq2uqs) , the label of the 
external boundary of the diagram is also K \uq1uq2uq3) . In order to fill this diagram as 
shown on the picture, one needs a new (stable) letter p and the following relations: 

• pk = kp for k G {ki, kj^}- 

• PQ = <1P for every q G {gi, q 2 , 93, -, Q^}- 

• pa = ap for every a from the N copies of {ai, a m }. 

• a p = ab for every a G {a±, ...,a m }. 

• ab = ba for every a G {a±, a m } and b G {61, b m }. 

• qib = bqi for every b G {61, ...,6 m }, i = 1,2,3. 

Since the label of the external boundary is K \uq1uq2uq3) , we can glue in a disc with 
this label, and make our annular diagram into an ordinary diagram with boundary label 
u^. Since the discs are filled with cells corresponding to the relations of G m ^ n , and the rest 
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is filled with cells corresponding to the new relations, we get that all defining relations of 
B(m,n) follow from the (finitely many) relations that we got. The group H that we just 
created is what we need. 

Theorem 16 (Olshanskii, Sapir, 1998) The natural homomorphism of B(m,n) into H is 
a quasi-isometric embedding. The group H has isoperimetric function n 8+e provided n is 
odd and > 10 10 ; linv^oo e = 0. 

Similarly we can quasi-isometrically embed a relatively free group G of any finitely 
based group variety into a finitely presented group. The resulting group will have a polyno- 
mial isoperimetric function provided G has polynomial verbal isoperimetric function. This 
function is defined as follows: 

Let v(x\, x n ) be a word. Suppose that w is in the verbal subgroup vsg(u). 
Then w = Yli v(Xij, ...,X n> i). Fix such a representation of w with minimal sum 
of lengths of all \Xj{\ involved in this representation. The verbal isoperimetric 
function gives an upper bound for this sum in terms of \w\. 

This function does not depend (up to "big O") on the identity defining the variety, so 
one can speak about verbal isoperimetric functions of varieties. 

For example, the variety of solvable groups has polynomial verbal isoperimetric function, 
so our construction embeds it into a group with polynomial Dehn function. 

The variety of Burnside groups of odd exponent n >> 1 has verbal isoperimetric function 
n 1+e (linin^oo e = 0). This can be proven by modifying Storozhev's argument from 
(Storozhev's argument gives estimate n 4 for the verbal isoperimetric function). 

We can also embed in a similar way the Baumslag-Solitar groups G\ tn into finitely 
presented groups with isoperimetric function n 10 . 

2 Methods 

2.1 S'-machines 

First of all let us present some ideas how to find a group with an "arbitrary" Dehn function. 
Consider again the main diagram called a disc on the Figure 3 for the group G mjTl . 

The disc is divided by the A;-bands into N sectors. The words written on the circles 
between consecutive k's have the form 

uqiuq 2 uq 3 

and to pass from one level to another level we replace qi by aq^. So we can imagine these 
words written on a tape of a Turing machine, qi mark the places where the heads are, and 
we have a rule of the form 

[qi -> aq 1 ,q 2 aq 2 , q 3 -> aq 3 ] 
16 



for every a. 

What we get is a simple example of a so called S-machine. 

Roughly speaking, the difference between S-machines and ordinary Turing machines is 
that S-machines are almost "blind" . They "see" letters written on the tape only when these 
letters are between two heads of the machine and the heads are very close to each other. 
If the heads are far apart, the machine does not see any letters on the tape, in this case a 
command executed by the machine depends only on the state of the heads. 

In contrast, ordinary Turing machines can see letters on the tape near the position 
where the head is. The command executed by the machine always depends not only on the 
state of the head but (which is very important!) also on the letter(s) observed by the head. 
Notice that even for moving the head a Turing machine one square to the left, one needs to 
know the content of the square to the left of the head. 

Let us give a precise definition of S'-machines. Let A; be a natural number. Consider 
now a language of admissible words. It consists of words of the form 



qiuiq2--.u k qk+i 

where q$ are letters from disjoint sets Qi, i = 1, k + 1, Uj are reduced group words in an 
alphabet Yi {Yi are not necessarily disjoint), the sets Y = {jY. L and Q = \JQi are disjoint. 

Notice that in every admissible word, there is exactly one representative of each Qi and 
these representatives appear in this word in the order of the indices of Qi. 

If < i < j < k and W = qiU\q2-..Ukqk+i is an admissible word then the subword 
qiUi...qj of W is called the (Qi, Qj)-subword of W (i < j). 



An S-machine is a rewriting system [22|. The objects of this rewriting system are all 
admissible words. 

The rewriting rules, or S-rules, have the following form: 

where the following conditions hold: 

Each Ui is a subword of an admissible word starting with a Q^-letter and ending with a 
Q r -letter (where I = £(i) must not exceed r = r(i), of course). 

If i < j then r(i) < £(j). 

Each Vi is also a subword of an admissible word whose Q-letters belong to Qeu) U ••• UQ r w 
and which contains a Q^-letter and a Qr-letter. 

If £(1) = 1 then V\ must start with a Qi-letter and if r(m) = k + 1 then V n must end with 
a Qfe+i-letter (so tape letters are not inserted to the left of Qi-letters and to the right 
of Qfc+i-letters). 

To apply an S-rule to a word W means to replace simultaneously subwords Ui by 
subwords Vi, i = 1, ...,m. In particular, this means that our rule is not applicable if one of 
the C/j's is not a subword of W. The following convention is important: 
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After every application of a rewriting rule, the word is automatically reduced. We do 
not consider reducing of an admissible word a separate step of an S-machine. 

We also always assume that an S-machine is symmetric, that is for every rule of the 
S-machine the inverse rule (defined in the natural way) is also a rule of this S-machine. 
This reflects the fact that the r-edges in the disc on Figure 3 can point away from the hub 
or toward the hub. 

Notice that virtually any S'-machine is highly nondeterministic. 

Among all admissible words of an S'-machine we fix one word Wq. If an S-machine S 
can take an admissible word W to Wo then we say that S accepts W. We can define a 
time and space function of an S-machine as usual. If U — > U\ — > ... — ► U n = Wo is an 
accepting computation of the S-machine S then \U\ + \U\ \ + ... + \U n \ is called the area of 
this computation. This allows us to define the area function of an S-machine. 

Theorem 17 (Sapir, ftSSjl) S -machines are polynomially equivalent to Turing machines. 
More precisely for every Turing machine M with time function T(n) there exists an S- 
machine with area function T(n) 4 which is equivalent to M (this means that there exists a 
correspondence 4> between configurations of M and admissible words of S, given a configu- 
ration c, the word 4>(c) is computable in linear time, and the machine M accepts c if and 
only if S accepts 4>(c)). 

In fact a stronger theorem can be deduced from the main results of |3^]. It was recently 
proved by Sapir. 

Theorem 18 (Sapir, 1998) For every Turing machine M with time function < T(n) such 
that T(n) 4 is superadditive, there exists an S-machine S with one head and only one internal 
state which is equivalent to M and has time function < T{n) . 

Notice that an S-machine with one head and one state letter is completely blind (in 
the sense explained above). The rules of such an S-machine have the following very simple 
form: 



[q -» uqv] 

where q is the internal state, u and v are words in the tape alphabet. 

The amazing fact is that the proof of a completely Computer Science statement, Theo- 
rem 18, involves some heavy geometric group theory. We first convert M into an S-machine 
Si with many heads, then convert Si into the group from [^] with Dehn function T(n) 4 , 
then convert the group into an S-machine S with one head and one internal state, having 
time function T(ra) 4 (in the last step we use an idea from Miller p5| ). 

The group Gn(S) associated with an S-machine S is constructed in the same way as 
the group G mtn presented above. We add all rules of S to the set of generators and for 
every rule r of the form \U\ — > Vi, U p — ► V p ] we have p relations U[ = V\, Up = V p . 
These relations replace the relations q\ = aqi in the presentation of G m ^ n . Other relations 
are the same. 
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Although this construction slightly differs from the construction in 
prove the following statement. 

Theorem 19 (Sapir, ffiSj]) Every Turing machine M with time function T(n) can be con- 
verted into an S -machine S in such a way that the Dehn function of the group Gn(S) is 
T(n) 4 provided T(n) 4 is superadditive. 

Now in order to embed a finitely generated group G into a finitely presented group we 
take a Turing machine M recognizing words which are equal to 1 in G, convert it into 
an S-machine S, and then basically repeat the construction of the group H m , n replacing 
B(m,n) by G and G m ^ n by Gjv(«S). The resulting group is denoted by Hn(S). It plays the 
role of group H in Theorem |14[ 



33 1 it is possible to 



2.2 Why S- machines? 

Here we will explain why we need to convert Turing machines into 5-machines. 

Consider any Turing machine M. For simplicity assume that M has one tape, which 
is always finite, but we can add squares at the right end of the tape, the alphabet A of 
tape letters, the set Q of states, and the set R of transitions. As usual we assume that 
the head is always placed between two squares of the tape, and observes both squares. So 
the transitions have the form uqv — > u'q'v' where u, v, u' , v' are words in the tape alphabet, 
q,q' € Q (see [32] for details). Then using the same idea as in the construction of G mtn we 



can replace the relations q\ = aqi by {uqv) r = u'q'v' (here r is a letter associated with the 
transition of M) . As in [ 32 we assume that M has only one accept configuration Wq . 



Thus we have the following presentation of the group G(M) associated with M. 

• (uqv) r = u'q'v' , for every transition r = [uqv — > u'q'v'] of the machine M, 

• ar = ra, for every a £ A and every transition r 

• kr = rk, for every k € {hi, kjy} and every transition r. 

As before, we need copies of each of these relations written in disjoint alphabets. The 
hub relation will have the form K(Wq) where Wo is the accept configuration. 

Now it is easy to see that for every accepted word U we can tessellate the disc with 
boundary label K(U) into cells labeled by these relations. Let U = U\ — > JJ% — > ... — > U p = 
Wo be an accepting computation. As before we will have a sequence of concentric circles, 
each labeled by K(Ui), the innermost oval will be labeled by K(Wo). 

So it is easy to see that the word K(U) is equal to 1 in G(M) if the configuration U is 
accepted by M. 

Unfortunately the converse statement is wrong in most cases and this is precisely why 
we need S-machines. Let us demonstrate this on a simple example. Consider the following 
Turing machine M. It has two states q, qo and one tape letter a. The only transitions are 
the following: 
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(ri) aq -> g , 
(r 2 ) ago -> 9o- 

The stop configuration is go (the tape is empty). It is clear that the set of configurations 
accepted by this machine consists of configurations a n go and a m q where n > 0, m > 0, and 
does not include, for example, the configuration q. Thus we would like K(q) to be not equal 
to 1 in G(M). The diagram on Figure 5 shows that K(q) is equal to 1 in this group. 



This picture shows the tessellation of only one of the sectors. The other sectors are 
tessellated in the same way. 

One can easily see the difference between this picture and the standard picture of a disc. 
Here we have pairs of cells which have two common edges, and in the standard disc cells 
could have at most one common edge. 

The diagram on Figure 6 is a subdiagram of the diagram on Fig. 5. It consists of 
two cells corresponding to the relations r 2 a = ar 2 and (ago)** 2 = qo and has boundary label 
corresponding to the relation {qoY 2 = a~ 1 qo which is the relation corresponding to the rule 
[qo — > a~ 1 qo], the inverse rule for [go — ► ago]- 




Fig. 5. 



a qo 



It is possible to prove that the group G(M) actually simulates the S-machine with the 
set of admissible words a n q, a n qo and the set of rules 



q 

g -»• a 



a 1 qo, 



plus the inverse rules. This S-machine is "stronger" than M, it accepts more configurations, 
including the configuration q. 

In general if we take any Turing machine M and repeat this construction we will get a 
group simulating the S'-machine obtained by replacing every transition uqv — > u'q'v' by the 
S-rule [q — > u u'q'v' v ]. This S'-machine will almost always be much stronger than the 
original Turing machine. 

One way around this problem was invented by Boone and Novikov |}2]]. This is why 
they used the Baumslag-Solitar type relations x a = x 2 . These relations prevent appearance 
of negative letters on the "tape" (the concentric circles in the disc). But we could not use 
these relations because they make the Dehn function exponential. 

Thus we had to prove instead that S-machines are polynomially equivalent to ordinary 
Turing machines (Theorem |T?|), 



2.3 Geometry of van Kampen Diagrams 

In order to analyze an arbitrary diagram over H = H mjn , and Hn(S) in general we change 
the presentation of H. We add all words K(u) (discs) and all relations of G = (B) to the 
presentation. The presentation becomes infinite. After that we order the relations, saying 
that the discs have the highest rank, r-relations have smaller ranks, and the 6-commutativity 
relations have the lowest rank. With every diagram we associate its type, a vector, the first 
coordinate of which is the number of discs, and the last coordinate is the number of b- 
commutativity cells (we omit the ranking of other relations). It turns out that diagrams of 
minimal type have nice geometric properties. 

The main and easy concept which helps us analyze these diagrams is the concept of 
a band 0. If S is a set of letters then an S-band is a sequence of cells 7Ti, ...,7r„ in a van 
Kampen diagram such that each two consecutive cells in this sequence have a common edge 
labeled by a letter from S. Figure 7 illustrates this concept. 
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Fig. 7. 



2 Other people call them corridors and strips. 
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The broken line formed of the intervals £(7Tj,ej), £(7Tj,ej_i) is called the median of this 
band. 

We say that two bands cross if their medians cross. We say that a band is an annulus 
if its median is a closed curve. In this case the first and the last cells of the band coincide 
(see Figure 8) 



Let S and T be two disjoint sets of letters, let (it, tt\, . . . , ir n , -it') be an S"-band and let 
(-7T, 71, . . . , 7 m , 7r') is a T-band. Suppose that: 

• the medians of these bands form a simple closed curve, 

• on the boundary of it and on the boundary of it' the pairs of S-edges separate the 
pairs of T-edges, 

• the start and end edges of these bands are not contained in the region bounded by 
the medians of the bands. 

Then we say that these bands form an (S, T) -annulus and the curve formed by the medians 
of these bands is the median of this annulus. 

For example, the diagram on Figure 3 contains fc-bands, g^-bands, ^4-bands crossing 
the circles transversally, and r-annuli filling the space between consecutive circles. In the 
diagram on Figure 4 we also have a p-annulus going around the disc, and many 6-bands 
consisting of the 6-commutativity cells. 

The main idea is the following. In most relations of the presentation of H one can 
choose two pairs of letters which belong to disjoint sets of letters. For example, the relation 
aba^ 1 ^ 1 = 1 has a pair of a-letters and a pair of S-letters. The cells corresponding to 
these relations must form a-bands and fe-bands in a van Kampen diagram. Each cell is an 
intersection of an a-band and a 6-band. Thus if we prove that the number of a-bands is 
"small" and the number of 6-bands is "small" , and that an a-band and a 6-band can have 
at most one common cell, we show that the number of (a, 6)-commutativity cells is "small". 

In order to bound the number of bands we use the following idea. Suppose that we have 
ruled out annuli. Then every band starts (ends) either on the boundary of the diagram (the 
number of such bands is linear in terms on the perimeter), or on the boundary of a cells (for 
example, an a-band can end on a disc). This gives us the direction in which to proceed. 




a 



b 



Fig. 8. 
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First we assume that a diagram contains no discs and prove the absence of certain types 
of annuli: r-annuli, p-annuli, a-annuli, (r, a)-annuli, etc. (22 different kinds altogether). 
One way to prove it is to use a simultaneous induction: assume that one of these annuli 
exists, take the innermost annulus of one of these kinds. Then the subdiagram bounded by 
this annulus does not contain annuli of any of the 22 kinds. This makes the subdiagram 
look nice and eventually leads to existence of a pair of cells that cancel (thus the diagram 
is not reduced which contradicts its minimality). 

Then we assume that the diagram contains discs and we bound the number of discs (see 
below) and their perimeters. Then we bound the number of r-bands by proving that there 
are no r-annuli, so each of the r-bands must start and end on the boundary of the diagram. 
Similarly we bound the number of p-bands. Then we bound the number of g-bands (they 
can start on the discs, and the perimeters of the discs are already bounded). Since every 
g-cell is an intersection of an r-band or a p-band and a g-band, we bound the number of 
(/-cells. This leads to a bound of the number of A-bands (they can end on g-cells and on 
discs), and so on. 

Of course we always need the absence of multiple intersections of bands. Although the 
next Figure 9 shows that a multiple intersection of an S*-band and a T-band does not 
necessarily produce an (S, T)-annulus, it turns out to be enough to rule out (S, T)-annuli. 
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Fig. 9. 



In order to bound the number of discs (and their perimeters) in a van Kampen diagram, 
we use the following idea. 

The generic diagram over the presentation of H looks like this: 
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Fig. 10. 



Discs in the diagram are connected by /c-bands. 

So with every van Kampen diagram we can associate a graph of discs. The vertices of 
this graph are the discs plus one external vertex. Vertices are connected by the fc-bands. 
If a fc-band starts on a disc and ends on the boundary of the diagram, we assume that 
this band terminates in the external vertex. The degree of each internal vertex of this 
graph is N >> 1. We prove that this graph cannot have bigons: two discs connected by 
a pair of fc-bands. This implies that the graph of discs is hyperbolic, and a standard small 
cancellation theory applies [23|. In particular there exists a disc with N — 3 external edges. 
This also implies that the number of discs and /c-bands in the diagram is linear in terms of 
the perimeter. 

In order to rule out r-annuli, p-annuli and other types of annuli, we use several type 
reducing surgeries on a diagram. One of them is illustrated by the following picture. 

Moving r-bands. Suppose that in a minimal diagram A an r-band TZ touches a disc II 
as in Figure 11 (that is one of the sides of TZ has two common /c-edges with the contour of 
the disc). Then it can be proved that the bottom path of TZ has a common subpath with the 
contour of II starting and ending with fc-edges. Let p be the maximal common subpath with 
this property, so that bot(i?) = qpq', d(U) = pp\. Without loss of generality we can assume 
that the label Lab(p) of the path p has the form kiwki+iw'ki^.-.kj where w = uq\uq2uq^ 
Then for some word V we have that Lab(p)V is a cyclic shift of K(w) = Lab((9(II)). One can 
construct an r-band TZ' with the bottom path labeled by the word V and the r-edges having 
the same labels as in TZ. Let TZ" be the subband of TZ with bottom path p, so TZ = TZ\TZ"TZi. 
Let e be the start edge of TZ" and let e' be the end edge of TZ". Cut the diagram A along 
the path e pie'. We can fill the resulting hole by gluing in the r-band TZ' and the mirror 
image TZ of ft'. The new diagram A' that we obtain this way will have two r-bands instead 
of the old r-band TZ. The first is TZ\(JZ ) _1 7?-2 (the inverse band (TZ )~ 1 differs from TZ by 
the order of cells) and the second one is TZ"TZ' . The second r-band is an annulus which 
touches II along its inner boundary. If we replace the disc II by the corresponding van 
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Kampen diagram over the presentation of Gjy(<S), we see that the subdiagram II' bounded 
by the outer boundary of the annulus TZ"TZ' is a diagram over the presentation of Gn(S) 
with exactly one hub and no r-edges on the boundary. Then one can prove that II' is a disc 
(corresponding to some computation). We replace it by one cell of the infinite presentation 
of H. Then we reduce the resulting diagram. 
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Fig. 12. 



Suppose that there are discs inside the region bounded by this annulus. Then these discs 
form a hyperbolic graph, and so the r-band will intersect more than 1/2 of the fc-bands going 
out of one of these discs. Then the r-band moving construction reduces the type of the 
diagram. Thus the region bounded by the r-annulus cannot contain discs. But we have 
ruled out the case when a diagram without discs contains an r-annulus, a contradiction. 

In order to bound the perimeters of discs and B-cells we use the following idea. The 
contour of a disc contains a constant number of non A-edges. Thus in order to bound the 
perimeter of a disc, we need to bound the number of A-edges on the contour of it. Every 
^4-edge on the contour of a discs is the start edge of an a-band. An a-band consists of 
a-commutativity cells corresponding to relations of the form ab = ba, ar = ra or ap = pa 
or to the relations of the form a p = ab. Thus an a-band can end either on a disc or on 
the boundary of a (a, g, r)-cell. The latter belongs to an r-band and we already know that 
the diagram contains only "small number" of r-bands. Thus if the a disc has a very big 
perimeter and many of the a-bands starting on the contour of this disc end on boundaries 
of (a, q, r)-cells, then many of these a-bands must end on the contour of the same r-band. 
The following lemma shows that it is impossible. 

Lemma 2 Let TZ\,...,1Z n be maximal a-bands starting on a path p where p is an A-subpath 
of the boundary of a disc II. Suppose that the end edges of all TZi are on the contours of 
r- cells belonging to the same r-band T . Then n < 2. 

Sketch of the Proof. Indeed, if n > 2 then there are three a-bands, say, 7Z±, 1Z%, IZ3 
starting on p and ending on three different cells 7Ti, tt2 and ir^ of T. We can assume that 
7T2 is between m and -n^ (see Figure 13). Consider the minimal subdiagram Ai of our 
diagram containing a-bands TZi , IZ2 , TZs , the minimal subpath of the path p containing the 
starting edges of 1Z±, IZ2, TZ^, and the part of the band T between it\ and 1x3 Then Ai has 
no fc-edges on its contour. Therefore Ai does not contain discs. Therefore the maximal 
g-band Q in Ai containing 712 divides A2 into two parts (that is if we delete the g-edges 
from Q, the diagram A2 will fall into two pieces). The subpath of the path p containing 
the start edges of TZi , IZ2 , 72-3 is contained in one of these parts since it does not contain 
Q-edges. The cells tt\ and 7r3 belong to different parts because Q cannot intersect T twice. 
Since the 7Ti and are connected with the cells on p by a-bands, one of these bands must 
intersect Q which is impossible (a g-band cannot cross an a-band). □ 
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Fig. 13. 



Finally we need to estimate the number and perimeters of 5-cells (i.e. cells correspond- 
ing to relations of the group {B)). Here we use the following trick. Suppose that two 
-B-cells are connected by a 6-band consisting of (a, 6)-commutativity cell. Then we can cut 
the two £>-cells together with the 6-band from the diagram, and replace it by one .B-cell and 
a number of (a, 6)-commutativity relations. This reduces the type of the diagram because 
the commutativity relations have smaller rank than i?-relations. Figure 14 shows how this 
surgery proceeds. 




Fig. 14. 

This implies that every 6-band starting on the contour of a B-ce\l must end either on 
the boundary of the diagram or on the contour of a (a, p, b)-ce\l. The number of maximal 
a-bands in the diagram is bounded (because the total perimeter of the discs is bounded, 
and the number of (/-cells is bounded too) , and a lemma similar to Lemma ^ shows that the 
number of 6-bands starting on the contour of the same 5-cell and ending on the contour 
of the same a-band is at most 2. This leads to the bound of the number of 5-cells and the 
total perimeter of 5-cells. 

Finally we can estimate the areas of words in H relative to the finite presentation of H. 
Take any word w which is equal to 1 in H. Then there exists a diagram over the infinite 
presentation of H (with discs and -B-cells) with boundary label w. The total perimeter of 
discs and £>-cells is bounded by a polynomial in \w\. Now replace every disc by the van 
Kampen diagram over the finite presentation of H (as in Fig. 3), and replace each 5-ceil 
by the diagram on Fig. 4 consisting of two discs and a relatively small number of other 
cells. The resulting diagram will be a van Kampen diagram over the finite presentation of 
H. It is easy to see that if the perimeter of a disc is p then the area is 0(T(p) 2 ) where 
T is the time function of the S'-machine. This gives an estimate of the area of w which is 
polynomially equivalent to T(|id|). 
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2.4 Why Is There No Distortion? 

The proof that the embedding of B(m, n) into H m ^ n and in general any recursively presented 
group G into H^(S) is undistorted also uses bands and annuli, and the structure of diagrams 
over the infinite presentation of H described in the previous section. 



Here we present the main points of the proof of bounded distortion in Theorems 1C , 14 



and 16 



For simplicity consider the case of the group H = B m ,n from Theorem [u]. The general 
case of Hn{S) is similar. By definition of bounded distortion, we have to find a constant 
c > such that for any element g € B(m,n) represented by a geodesic (in B(m,n)) word 
U = U (&i, . . . , b m ) in the alphabet B = {b\, . . . , b m } and for any word Z in the generators 
of H, that represents the same element, we have \Z\ > c\U\. 

In order to achieve this goal, consider the minimal diagram A over the infinite pre- 
sentation of H considered in the previous section, with boundary label UZ -1 . Then the 
boundary of A has the form p _1 p' where Lab(p) = U, Lab(p') = Z. We need to show that 
H < c\p'\ for some constant c. It suffices to make a correspondence between fo-edges of p 
and edges of p', such that any edge of p' corresponds to at most c _1 edges of p. 

First of all notice that we can assume that no B-cell in A has a common edge with 
p. Indeed, if such a B-cell exists, we can cut it off reducing the type of the diagram 
and replacing the path p with a not shorter path p\ (recall that U was a geodesic word 
representing g). 

Therefore for every edge e on p, there is a maximal 6-band T in A, starting at e. It can 
end neither on p nor on the boundary of a G^-cell (both cases are ruled out in the same 
manner as it was done in the previous section: we can do a type reducing surgery again). 

If T ends on the path p', we associate the terminal edge of T with e. Another possibility 
is that T terminates on the boundary of some maximal a-band (at the cell labeled by a 
relator p~ 1 aipb^ 1 a^' 1 ). A lemma similar to Lemma ^ shows that at most 2 maximal 6-bands 
starting on p can end on the boundary of the same a-band. This means that we can consider 
the set A of a-bands where these 6-bands end. 

The most pleasant (for us) among these a-bands are those which start or end on p' (they 
cannot end on p because p does not have ^4-edges). Other a-bands can terminate either on 
contours of r-bands or on disks. 

We need to consider two cases. In the first case the number of those r-bands is large 
(proportional to the number of a-bands in A). Since there are no r-annuli in A (see the 
previous section), each of these r-bands starts and ends on p', and we obtain a desired 
inequality \p'\ > c\p\. 

In the second case we have to assume that the number of r-bands where a-bands termi- 
nate is "small" . Since by a variation of Lemma || the number of a-bands terminating on the 
same r-band is bounded by a constant, in this case most a-bands in A terminate on discs. 

In this situation we use the so called ovals and their shadows (see |2S] and Q). 



An oval is a simple closed path h in the disk graph of the diagram A. It divides the 
plane into two regions. One of them, denoted by 0(h), must possess the following property. 
For every disk LT on h, the number n\ of maximal fc-bands going from II into 0(h) and 
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the number ni of the maximal fc-bands going from II into the exterior of 0(h) satisfy the 
following inequalities: 

Til > ri2 + 8, ri2 < 2n\. 

The hyperbolicity of the disk graph and high degrees of its interior vertices make possible 
drawing an oval h passing via any interior edge of the disc graph of A. 

One of the main properties of ovals is that if an r-band starts outside the subdiagram 
0(h) bounded by the oval and then intersects the oval, it cannot leave 0(h). Indeed 
otherwise the hyperbolicity of the disk graph would imply the existence of either a (k, r)- 
annulus (which is impossible, see the previous section) or an r-band intersecting too many 
maximal A;-bands starting on the same disk (again it is impossible because of the Moving 
r-bands construction from the previous section). 

Thus any maximal r-band crossing an oval h, must intersect its shadow, i.e. the bound- 
ary subpath of the diagram lying in 0(h). This allows us to prove that the shadow of any 
oval h is sufficiently long comparing to the perimeter of any disk II crossed by h. We can 
also choose h in such a way that the shadow of h is inside p' (because p does not contain 
fc-edges). Therefore the number of a-bands ending on the contour of a disc does not exceed 
the length of the shadow of any oval passing through the disc. If the bands A end on differ- 
ent discs IIi,..., Ilk then the hyperbolicity of the disc graph allows us to find ovals passing 
through these discs which have disjoint shadows, all inside p' . Thus the length of p' cannot 
be smaller than the number of a-bands in A, which in turn, as we know, cannot be much 
smaller than the length of p. 

The proof of the result that the shadow of an oval h is sufficiently long comparing to the 
perimeter of a disc II in h consists of two cases. In the first case the number of maximal r- 
band in 0(h) is sufficiently large (greater than, say, ^ of the number of all a-edges between 
successive /c-edges of II). This case is clear since all the r-bands must terminate on the 
shadow. 

The second case is complementary to the first one. Since the number of the r-bands is 
small, the quantity of the a-bands going from II into 0(p) and terminating on r-bands, is 
small too (Lemma Q works again). Therefore a majority of them terminates either in the 
shadow of II (this is the best alternative for us) or on some disks n',n",...nW inside of 
0(h). 

This situation can be analyzed by induction: as before we can draw ovals hi,... ,hf. 
passing through II', . . . , 11^ respectively, whose shadows are disjoint and are inside the 
shadow of h. 

2.5 Embeddings With Given Length Functions 

Here we present the main ideas of the proofs of results from |]28| (see Section [Qj| ) . 

If G is a subgroup of a group H with a finite set of generators B = {bi, . . . ,b n } then 
the function £(g) = \g\s on G evidently satisfies conditions (D1)-(D3) from Theorem ||. For 
instance condition (D3) holds because the number of all words of length at most k in the 
alphabet B grows exponentially as k —* oo. 
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To prove that every function G — > N satisfying conditions (D1)-(D3) can be realized as 
the length function of G inside a finitely generated group H , we start with a presentation 
G = Fq/N, where Fq is a free group with the basis {x g } g ec\{i} an d N is the kernel of 
homomorphism e : x g i— > g. 

Then we construct an embedding : x g — > X g of Fq into the 2-generated free group 
F = F2 = F(&i,&2), such that the image P(Fq) is freely generated by X g , g G G\{1} and 
the words X g are very "independent" in the sense described below. 

The group H is equal to the quotient F/L, where L is the normal closure of (3(N) in F. 

Notice that whatever homomorphism (3 we choose, it induces a homomorphism 7 : G — > 
H. To make this homomorphism injective, we need the following property: 

L n /3(Fg) = 0(N). 

In fact (3{Fg) satisfies the following much stronger property: 

(*) For any normal subgroup U < (3(Fq) there is a normal subgroup V < F such that 

u = vn(3(F G ). 

The fact that free groups and more generally every non-elementary hyperbolic group 
has plenty of infinitely generated free subgroups with property (*) is interesting in its own 
right, it was the key ingredient in Olshanskii's proof from |27| of the fact that every non- 
elementary hyperbolic group is SQ-universal. 

It turns out that we can make (5 satisfy condition (*) by choosing reduced words X g 
with the following small cancellation condition: 

(**) If Y is a subword of a word X g and \ Y\ > tq\X 9 \ then Y occurs in X g as a subword 
only once, and Y occurs neither in X^ 1 nor in X^ for h 7^ g. 

It is relatively easy to construct an infinite set of words X g in the alphabet {61, 62} which 
satisfies the (**)-condition and has exponential growth, that is the number of different words 
X g of length k grows exponentially as k — > 00. 

Since by condition (D3) the number of elements g G G with 1(g) < k does not exceed 
c k for some constant c, we can choose the set {X g } in such a way that 

lis) <\X g \< di{g) 

for some positive constant d and every g G 

We need to show that the embedding 7 : G — > H has bounded distortion. For this we 
take any element X g of 7(G) and consider the shortest word W in the alphabet {61,^2} 
representing X g in H. The group H is given by the presentation consisting of all relations 
of the form X gi X g2 ...X gn = 1 where gig2--Sn = 1 in G. 

Since X g = W modulo this presentation, we can consider the corresponding van Kampen 
diagram A with boundary label X g W~ l . We can assume that the number of cells in A is 
minimal among all such diagrams. 

The condition (**) implies the following property of van Kampen diagrams over the 
presentation of H. Let IT and H2 be cells in a diagram A having a common edge. Then 
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either any common arc p of the boundaries dUi and <9Ii2 is short comparing to the perimeters 
Pi, P2 of the cells (say, \p\ < min(Pi, P2)), or a subdiagram consisting of III and II2, has 
also a boundary label of the form w = w(X 9l , . . . , X 9n ), i.e. the subdiagram can be replaced 
by one cell. The latter option cannot occur in A because of the minimality of the diagram 



A. Thus A satisfies a small cancellation condition [23|. 

This in turn allows us to prove that the word W is freely equal to a product X^ 1 . . . X^ 1 
for some gj € G with g = g^ 1 . . . g^ 1 (see 1 28] for details). Further, since the cancellations 
in such a product are small, 



|7(<?)I* >(i-|)£l*J- 



By conditions (Dl), (D2), and by the choice of X g , we have: 

h(g)\ H > 0.96 £ %j) = 0.96^%f) > 0.96%). 
Hence 0.961(g) < \j(g) \h < d£(g), so £ is O-equivalent to the length function of G in H. 
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